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(54) Factorial chemical libraries 



(57) A method and library for determining the se- 
quence of monomers in a polymerwhich is complemen- 
tary to a receptor. The method provides for formation of 
pooled (6) and separate (10, 12) products. Separate 
products are subjected only to subsequent pooled cou- 
pling steps. Each pooled product is subsequently divid- 
ed for formation of pooled and separate products. The 



resulting polymer library includes groups of polymer 
products. Afirst group of products (42) is used to identify 
the monomer at afirst location in a polymerthat is com- 
plementary to a receptor. A second group of products 
(44) is used to identify the monomer at a second location 
in a polymer that is complementary to a receptor 
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Description 

BACKGROUND OF THE INVENTION 

5 [0001] The present invention relates to the field of polymer screening. More specifically, in one embodiment the 
invention provides an improved polymer library and method of using the library to identify a polymer sequence that is 
complementary to a receptor. 

[0002] Many assays are available for measuring the binding affinity of receptors and ligands, but the information 
which can be gained from such experiments is often limited by the number and type of ligands which are available. 

'0 Small peptides are an exemplary system for exploring the relationship between structure and function in biology. When 
the twenty naturally occurring amino adds are condensed Into peptides they form a wide variety of three-dimensional 
configurations, each resulting from a particular amino acid sequence and solvent condition. The number of possible 
pentapeptldes of the 20 naturally occurring amino acids, for example. Is 20^ or 3.2 million different peptides. The 
likelihood that molecules of this size might be useful in receptor-binding studies Is supported by epitope analysis studies 

15 showing that some antibodies recognize sequences as short as a few amino acids with high specificity. 

[0003] Prior methods of preparing large numbers of different oligomers have been painstakingly slow when used at 
a scale sufficient to permit effective rational or random screening. For example, the "Merrlfleld" method, described In 
Atherton et al., "Solid Phase Peptide Synthesis," IRL Press, (1989), Incorporated herein by reference for all purposes, 
has been used to synthesize peptides on a solid support such as pins or rods. The peptides are then screened to 

20 determine if they are complementary to a receptor. Using the Merrlfleld method. It Is not economically practical to 
screen more than a few peptides in a day, 

[0004] Similar problems are encountered in the screening of other polymers having a diverse basis set of monomers. 
For example, various methods of oligonucleotide synthesis such as the phosphite-triester method and the phosphot- 
rieseter method, described in Gait, "Oligonucleotide Synthesis," IRL Press, (1990), incorporated herein by reference 

25 for all purposes, have similar limitations when it is desired to synthesize many diverse oligonucleotides for screening. 
[0005] To screen a larger number of polymer sequences, more advanced techniques have been disclosed. For ex- 
ample, Pirrung et al., WO 90/15070, Incorporated herein by reference for all purposes, describes a method of synthe- 
sizing a large number of polymer sequences on a solid substrate using light directed methods, Dower etak, U.S. 
application Serial No. 07/762,522, also Incorporated by reference herein for all purposes, describes a method of syn- 

30 thesizing a library of polymers and a method of use thereof. The polymers are synthesized on beads, for example. A 
first monomer Is attached to a pool of beads. Thereafter, the pool of beads Is divided, and a second monomer Is 
attached. The process Is repeated until a desired, diverse set of polymers Is synthesized. 

[0006] Other methods of synthesizing and screening polymers have also been proposed. For example, Houghten 
et al., "Generation and Use of Synthetic Peptide Combinatorial Libraries for Basic Research and Drug Discovery, " 

35 Nature (1 991 ) 354:84-86. discuss a method of generating peptide libraries that are used for screening the peptides for 
biological activity (see also, Houghton etal., "The Use of Synthetic Peptide Combinatorial Libraries for the Identification 
of Bioactive Peptides," Peptide Research (1992) 5:351-358). Houghten synthesized a peptide combinatorial library 
(SPCL) composed of some 34 x 1 0^ hexapeptides and screened it to identify antigenic determinants that are recognized 
by a monoclonal antibody, Furka et al,, "General Method for Rapid Synthesis of Multicomponent Peptide Mixtures," 

40 Int. J. Peptide Protein Res. (1 991 ) 37:487-493, discusses a method of synthesizing multicomponent peptide mixtures. 
Furka proposed pooling as a general method for the rapid synthesis of multicomponent peptide mixtures and illustrated 
its application by synthesizing a mixture of 27tetrapeptides and 180 pentapeptldes. Lam et al., "A new type of synthetic 
peptide library for identifying ligand-binding activity," Nature (1 991 ) 354 :82-84 used pooling to generate a pentapeptide 
bead library that was screened for binding to a monoclonal antibody. Blake et al. "Evaluation of Peptide Libraries: An 

45 Interative Strategy To Analyze the Reactivity of Peptide Mixtures With Antibodies," Bioconjugate Chem. (1992) 3: 
510-513 discusses the screening of presumed mixtures of 50,625 tetrapeptides and 16,777,216 hexpeptides to select 
epitopes recognized by specific antibodies. 

[0007] Lam's synthetic peptide library consists of a large number of beads, each bead containing peptide molecules 
of one kind. Beads that bind a target (e.g., an antibody orstrepavldin) are rendered colored or fluorescent. Lam reports 

50 that several million beads distributed In 10-15 petrl dishes can be screened with a low-power dissecting microscope 
In an afternoon. Positive beads are washed with BM guanldine hydrochloride to remove the target protein and then 
sequenced. The 100-200 iim diameter beads contain 50-200 pmol of peptide, putatlvely well above their 5 pmol sen- 
sitivity limit. Three pentapeptide beads were sequenced daily. The essence of Lam's method is that the identity of 
positive beads Is established by direct sequencing. 

55 [0008] Houghten et al. use a different approach to Identify peptide sequences that are recognized by an antibody. 
Using the nomenclature described herein, Houghten et al. screened an X6X5X4pX3pX2pXip library and found that the 
mixture DVX4pX3pX2pX.|p had greatest potency In their Inhibition assay. Houghten then synthesized a DVX4X3pX2pX.|p 
library and Identified the most potent amino acid In the third position. After three more Iterations, they found that DVP- 
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DYA binds to the antibody with a of 30 nIVI. The essence of Houghten's method is recursive retrosynthesis, in which 
the number of pooled positions decreases by one each iteration. 

[0009] Blal<e et al. used a "bogus coin strategy" to guide them to a preferred amino acid sequence, in this strategy 
a basis set of monomers (15 amino acids) is first divided into three groups. Biai<e et al. chose A, L, V, F, Y (subgroup 

5 a), G, S, P, D, E (subgroup p), and K, R, H, N, Q (subgroup y). By adjusting the "weighting of the subgroups at each 
position In the polymer sequence, and then testing the activity of the weighted polymer against an unweighted polymer, 
one subgroup was selected for each monomer position In the sequence. In an experiment conducted by Blake etal., 
a complete collection of tetramers X-|p Xgp Xgp was reduced to a2 Y3 CC4 by four inhibition experiments. Then 
the subgroups a and y were each further subdivided into three groups of amino acid which were used to synthesize 

10 four more collections of weighted polymers, inhibition studies with each of these collections suggested an epitope (F 
or Y)i (A or 1)3 (K or R)3 (F or Y)4. One more iteration gave the desired epitope FLRF. 

[0010] While meeting with some success, prior methods have also met with certain limitations. For example, it is 
sometimes desirable to avoid the use of the equipment necessary to conduct light directed techniques. Also, some 
prior methods have not produced the desired amount of diversity as efficiently as would be desired. 
15 [0011] From the above. It Is seen that an Improved method and apparatus for synthesizing a diverse collection of 
chemical sequences Is desired. 

SUMIVIARY OF THE INVENTION 

20 [0012] An improved polymer library and method of screening diverse polymers is disclosed. The system produces 
libraries of polymers in an efficient manner, and utilizes the libraries for identification of the monomer sequence of 
polymers which exhibit significant binding to a ligand. 

[0013] According to one aspect of the invention, a library of polymers is formed using "pooled" and "unpooled" (or 
"separate") coupling steps. In the pooled steps, each of the monomers from a basis set of monomers is coupled to the 
25 terminus of a growing chain of monomers on a plurality of previously mixed solid substrates. The mixed substrates are 
divided for coupling of each Individual monomer in a basis set. In separate steps, the substrates are not Intermixed 
from a previous coupling step, and each of the monomers in a basis set is separately coupled to the terminus of a 
growing chain of monomers on a plurality of the unmixed substrates. 

[0014] According to one preferred aspect of the Invention, pooled steps and unpooled steps arc ordered such that 

30 the identification of a monomer sequence which binds to a receptor can be readily identified from the library. For 
example, according to one prefen-ed embodiment of the Invention, several groups of products are derived from the 
synthesis steps. Each group Is used to Identify the monomer at a specific position In the polymer chain. 
[0015] According to most preferred aspects of the Invention, the library Is constructed using an ordered series of 
coupling steps In which products resulting from a separate step are, thereafter, only subjected to pooled coupling steps. 

35 Products resulting from a pooled coupling step which have not been previously subjected to an unpooled step are 
always divided for pooled and unpooled coupling. This ordered series of steps results In a relatively small number of 
coupling steps, but still allows for identification of the monomer sequence of a polymer which Is complementary to a 
receptor of interest. For example, afirst group of products Is used to identify the monomer at a first location in a polymer 
that is complementary to a receptor A second group of products Is used to Identify the monomer at a second location 

40 in a polymer that is complementary to a receptor 

[0016] Accordingly, in one embodiment of the invention provides a polymer library screening l^it. The i<it includes 
families of polymers X3-X2p-X.|p, Xgp-Xj-X^p, and Xgp-Xjp-X^ wherein Xgp-Xjp-X., comprises a collection of at least first 
and second polymer mixtures, the first polymer mixture having afirst monomer in a first position of polymer molecules 
therein, and different monomers in second and third positions of the polymer molecules therein, and wherein the second 

45 polymer mixture has a second monomer in the first position of polymer molecules therein, and different monomers in 
second and third positions of the polymer molecules therein; X3p-X2-X.|p comprises a collection of at least third and 
fourth polymer mixtures, the third polymer mixture having a third monomer In the second position and the fourth polymer 
mixture having a fourth monomer In the second position, each of the third and fourth polymer mixtures having different 
monomers in the first and third positions; and X3-X2p-X.|p comprises a collection of at least fifth and sixth polymer 

50 mixtures, the fifth polymer mixture having a fifth monomer in the third position and the sixth polymer mixture having a 
sixth monomer In the third position, each of the fifth and sixth polymer mixtures having different monomers In the first 
and second positions, wherein the first, third, and fourth monomers are the same or different and the second, fourth, 
and fifth monomers are the same or different. 

[0017] A method of identifying first and second monomers in a polymer that is complementary to a receptor is also 
55 provided. The method includes the steps of coupling first and second monomers in a first basis set to individual sub- 
strates and mixing substrates to form first pooled products; coupling the first and second monomers from the first basis 
set to individual substrates, and not mixing the substrates to form at least first and second separate products; separately 
coupling first and second monomers, from a second basis set to substrates from the first pooled products and not 
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mixing the substrates to form at ieast tiiird and fourth separate products, the second basis set being the same or 
different than the first basis set; coupling the first and second monomers from the second basis set to individual sub- 
strates from the first separate products and mixing the substrates to form second pooled products; coupling the first 
and second monomers from the second basis set to individual substrates from the second separate products to form 
5 third pooled products; and exposing a receptorto the third and fourth separate products to identify a second monomer 
in a polymer which is complementary to a receptor, and exposing the second and third pooled products to the receptor 
to identify a first monomer in a polymer which is complementary to a receptor. A polymer screening technique using 
factoring is also disclosed. 

[0018] A further understanding of the nature and advantages of the inventions herein may be realized by reference 
?o to the remaining portions of the specification and the attached drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0019] 

15 

Figs. 1a and 1b are schematic diagrams of specific embodiments of the Invention; 
Fig. 2 illustrates a simple reaction graph; 

Fig. 3 Illustrates a reaction graph with pooled and separate products; 

Fig. 4 illustrates a simplified reaction graph; 
20 Figs. 5a, 5b, and 5c illustrate a family of pooled syntheses; 

Fig. 6 illustrates a reaction graph for forming the products XgpXgX^p; 

Fig. 7 illustrates a reaction graph for all 64 trinucleotides; 

Fig. 8 illustrates the synthesis of AAT TGC, TGT GTA, GTG, and COG; 

Fig. 9 provides an alternative representation of the invention; 
25 Figs. 1 0a, 1 0b, and 1 0c illustrate a recursive retrosynthesis embodiment of the invention; 

Figs. 11a, lib, and 11c illustrate a combinatorial synthesis chamber of the invention; and 

Figs. 12 illustrates a polymer library according to one embodiment of the invention. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

30 

CONTENTS 
[0020] 

35 I . Terminology 

II. Overall Description 

III. Polynomial Factoring Applied to Screening 

IV. Conclusion 

40 I. Terminology 
[0021] 

Ligand : A ligand is a molecule that is recognized by a particular receptor. Examples of ligands that can be inves- 
ts tigated by this invention include, but are not restricted to, agonists and antagonists for cell membrane receptors, 
toxins and venoms, viral epitopes, hormones (e.g., opiates, steroids, etc.}, hormone receptors, peptides, enzymes, 
enzyme substrates, cofactors, drugs, lectins, sugars, oligonucleotides, nucleic acids, oligosaccharides, proteins, 
and monoclonal antibodies. 

50 Monomer : A member of the set of small molecules which are or can be joined together to form a polymer. The set 

of monomers includes but Is not restricted to, for example, the set of common L-amino acids, the set of D-amino 
acids, the set of synthetic and/or natural amino acids, the set of nucleotides and the set of pentoses and hexoses, 
as well as subsets thereof. The particular ordering of monomers within a polymer is referred to herein as the 
"sequence" of the polymer. As used herein, monomers refers to any member of a basis set for synthesis of a 

55 polymer. For example, dimers of the 20 naturally occurring L-amIno acids form a basis set of 400 monomers for 

synthesis of polypeptides. Different basis sets of monomers may be used at successive steps in the synthesis of 
a polymer. Furthermore, each of the sets may include protected members which are modified after synthesis. The 
invention is described herein primarily with regard to the preparation of molecules containing sequences of mon- 
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omers such as amino acids, but could readily be applied in the preparation of other polymers. Such polymers 
include, for example, both linear and cyclic polymers of nucleic acids, polysaccharides, phospholipids, and peptides 
having either a-, p-, or oo-amino acids, heteropolymers In which a known drug is covalently bound to any of the 
above, polynucleotides, polyurethanes, polyesters, polycarbonates, polyureas, polyamides, polyethyleneimines, 

5 polyarylene sulfides, polysiloxanes, polyimldes, polyacetates, orotherpolymerswhich will be apparent upon review 

of this disclosure. Such polymers are "diverse" when polymers having different monomer sequences are formed 
at different predefined regions of a substrate. Methods of cycllzation and polymer reversal of polymers which may 
be used in conjunction with the present invention are disclosed in copending application Serial No. 796,727, filed 
November 22, 1991 entitled "POLYMER REVERSAL ON SOLID SURFACES," incorporated herein by reference 

to for all purposes. The "position" of a monomer in a polymer refers to the distance, by number of monomers, from 

a terminus or other reference location on a polymer. 

Peptide: A polymer in which the monomers are alpha amino acids and which are joined together through amide 
bonds, alternatively referred to as a polypeptide. In the context of this specification it should be appreciated that 
15 the amino acids may be the L-optical isomer or the D-optical isomer. Peptides are often two or more amino acid 

monomers long, and often more than 20 amino acid monomers long. Standard abbreviations for amino acids are 
used (e.g., P for proline). These abbreviations are included in Stryer, Biochemistry , Third Ed., 1988, which is 
Incorporated herein by reference for all purposes. 

20 Receptor: A molecule that has an affinity for a given ligand. Receptors may be naturally-occurring or manmade 

molecules. Also, they can be employed in their unaltered state or as aggregates with other species. Receptors 
may be attached, covalently or noncovalently, to a binding member, either directly or via a specific binding sub- 
stance. Examples of receptors which can be employed by this invention include, but are not restricted to, antibodies, 
cell membrane receptors, monoclonal antibodies and antisera reactive with specific antigenic determinants (such 

25 as on viruses, cells or other materials), drugs, polynucleotides, nucleic acids, peptides, cofactors, lectins, sugars, 

polysaccharides, cells, cellular membranes, and organelles. Receptors are sometimes referred to in the art as 
anti-ligands. As the term receptors is used herein, no difference in meaning is intended. A "Ligand Receptor Pair" 
is formed when two macromolecules have combined through molecular recognition to form a complex. 

30 [0022] Specific examples of receptors which can be investigated by this invention include but are not restricted to: 

a) Microorganism receptors : Determination of ligands which bind to receptors, such as specific transport proteins 
or enzymes essential to survival of microorganisms, Is useful in a new class of antibiotics. Of particular value would 
be antibiotics against opportunistic fungi, protozoa, and those bacteria resistant to the antibiotics in current use. 
35 b) Enzymes: For instance, the binding site of enzymes such as the enzymes responsible for cleaving neurotrans- 

mitters; determination of ligands which bind to certain receptors to modulate the action of the enzymes which 
cleave the different neurotransmitters is useful in the development of drugs which can be used in the treatment of 
disordere of neurotransmission. 

c) Antibodies : For instance, the invention may be useful in investigating the ligand-binding site on the antibody 
40 molecule which combines with the epitope of an antigen of interest; determining a sequence that mimics an anti- 
genic epitope may lead to the development of vaccines of which the immunogen is based on one or more of such 
sequences or lead to the development of related diagnostic agents or compounds useful in therapetrific treatments 
such as for autoimmune diseases (e.g., by blocking the binding of the "self" antibodies). 

d) Nucleic Acids : Sequences of nucleic acids may be synthesized to establish DNA or RNA binding sequences. 
45 e) Catalytic Polypeptides : Polymers, preferably polypeptides, which are capable of promoting a chemical reaction 

involving the conversion of one or more reactants to one or more products. Such polypeptides generally include 
a binding site specific for at least one reactant or reaction intermediate and an active functionality proximate to the 
binding site, which functionality is capable of chemically modifying the bound reactant. Catalytic polypeptides and 
others are described in, for example, PCr Publication No. WO 90/05746, WO 90/05749, and WO 90/05785, which 
50 are incorporated herein by reference for all purposes. 

f) Hormone receptors : For instance, the receptors for insulin and growth hormone. Determination of the ligands 
which bind with high affinity to a receptor is useful in the development of, for example, an oral replacement of the 
daily injections which diabetics must take to relievethe symptoms of diabetes, and in the other case, a replacement 
for the scarce human growth hormone which can only be obtained from cadavers or by recombinant DNAtech- 

55 nology. Other examples are the vasoconstrictive hormone receptors; determination of those ligands which bind to 

a receptor may lead to the development of drugs to control blood pressure. 

g) Opiate receptors : Detennination of ligands which bind to the opiate receptors in the brain is useful in the devel- 
opment of less-addictive replacements for morphine and related drugs. 
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Substrate or Solid Support: A material having a surface and which is substantially Insoluble In a solution used for 
coupling of monomers to a growing polymer chain. Such materials will preferably lake the form of small beads, 
pellets, disks or other convenient forms, although other fonns may be used. A roughly spherical or ovoid shape is 
preferred. 

5 

Basis Set: A group of monomers that Is selected for attachment to a solid substrate directly or indirectly in a given 
coupling step. Different basis sets or the same basis sets may be used from one coupling step to another in a 
single synthesis. 

to Synthetic : Produced by in vitro chemical or enzymatic synthesis. The synthetic libraries of the present Invention 

may be contrasted with those in viral or plasmid vectors, for instance, which may be propagated In bacterial, yeast, 
or other living hosts. 



X| denotes the set of monomer units in reaction round i. 
Xjj denotes the j'th monomer unit In reaction round i; Xy can be a null (0). 
20 S| refers to the separated products after reaction round i. 

P| refers to the pooled products of round i and ail preceding rounds. 
X|p denotes the pooling of reactants of round i only 

Reaction Graphs 

25 

[0024] A filled circle • denotes a reaction product terminating in a particular monomer unit Xy. The set of reaction 
products terminating In X| Is shown by a set of circles on the same horizontal level. 

[0025] Filled circles that react with each other are connected by straight lines. Pooling is shown by lines meeting 
below in an open circle. 

30 [0026] A factorable polynomial synthesis is one in which each monomer unit of a round is joined to each monomer 
of the preceding round. In a graph of such a synthesis, each filled circle at one level is connected to each filled circle 
of the level above. For example, the reaction graph corresponding to a three-round factorable synthesis with 



which yields ali 64 trinucleotides, is shown In Fig. 7. 

[0027] In contrast, in an Irreducible (prime) polynomial synthesis, at least one line in the graph of the corresponding 
factorable polynomial synthesis Is missing. In the synthesis of AAT TGC, TGT, GTA, GTG, and CCG only, such syn- 
40 theses are Illustrated in Fig. 8. 

II. Overall Description 

[0028] Fig. 1 Is an overall Illustration of one aspect of the invention. As shown therein, monomers A and B, which 
45 form all or part of a first basis set of monomers, are coupled to substrates 2 In vessels 4a and 4b. The substrates In 
each of the vessels 4a and 4b are divided. A portion of the substrates from each of vessels 4a and 4b are mixed In 
vessel 6, and divided for a subsequent coupling step into vessels 6a and 6b. Another fraction of the monomers from 
vessels 4a and 4b is not mixed, as indicated by vessels 10 and 12. 

[0029] Thereafter, the substrates are coupled to monomers from a second basis set C,D, which may or may not be 
50 the same as the basis set A,B. As shown, the monomer C is coupled to the mixed or "pooled" substrates in vessel 6a, 
while the monomer D is coupled to the "pooled" substrates in vessel 6b. A portion of the products of these reactions 
may be mixed for later coupling steps, but at least a portion of the products in vessels 6a and 6b are not mixed. 
[0030] The products In vessels 1 0 and 1 2 are preferably each divided for coupling to monomer C as shown in vessels 
10b and 12b, while the substrates In vessels 10a and 12a are used to couple the monomer D to the growing polymer 
55 chain. The products of the reactions In vessels 1 0a and 1 0b are mixed or pooled, and placed In vessel 20. The products 
of the reactions in vessels 12a and 12b are mixed or pooled, and placed in vessel 22. 

[0031] The products in vessels 20 and 22 are, thereafter, used to Identify a first monomer in a polymer which is 
complementary to a receptor of interest. It Is assumed for the sake of illustration herein that the monomer sequence 



Symbols 



15 



[0023] 



Xi = X2 = X3 = {A,T,G,C} 
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AC is complementary to the receptor R. A receptor labeled with, for example, a fluorescent or radioactive label*, Is 
exposed to the materials In vessels 20 and 22, and unbound receptor Is separated from the solid supports. Binding to 
the substrates will occur only with the substrates In vessel 20. Fluorescence is, therefore, observed only in vessel 20. 
From this observation. It Is possible to conclude that the first monomer in a complementary receptor is A, since all of 
5 the polymers in vessel 22 contain the first monomer B. Conversely, all of the polymers in vessel 20 contain the first 
monomer A. 

[0032] The labeled receptor is also exposed to the polymers in vessels 26a and 26b. In this case, binding of the 
labelled receptor will be observed only in vessel 26a. Accordingly, it is possible to identify the second monomer in a 
complementary sequence as C, since none of the polymers in vessel 26b contain the second monomer 0, while all of 
10 the polymers in vessel 26a contain the second monomer 0. Therefore, it is possible to conclude that the sequence AC 
is complementary to R since binding is observed In vessels 26a and 20. 

[0033] Fig. 1b illustrates aspects of a preferred embodiment of the invention in greater detail with a larger polymer 
chain. According to the embodiment shown in Fig. 1b, a basis set of 3 monomers, A, B, and 0 is used in each coupling 
step. The synthesized polymers are to be three monomers long. It will be recognized by those of skill in the art that 

15 the number of monomers in a basis set and the number of coupling steps will vary widely from one application to 
another. Also, Intervening coupling steps of, for example, common monomer sequences may be used in some em- 
bodiments. Therefore, when a polymer is represented by, for example, the notation "ABC" or "ABE" herein, it is to be 
understood that other common monomers may be added such that ABDC and ABDE are represented by ABC and 
ABE. The embodiment shown In Fig. lb is provided merely as an illustration of the invention. 

20 [0034] As shown in Fig. 1 b, the synthesis takes place on a plurality of substrates 2. According to a preferred aspect 
of the invention, the substrates 2 take the form of beads, such as those made of glass, resins, plastics, or the like. The 
term "beads" is used interchangeably herein with the word "substrate," although it is to be understood that the beads 
need not take on a circular or ovoid shape and can take the form of any suitable substrate. It will be further understood 
that the substrates 2 are shown only in the top portion of Fig. 1b, but the substrates will be present in each of the 

25 reaction products shown in Fig. 1 b to the left of the monomer sequences. In each vessel in Fig. 1 b, all of the possible 
polymer products are listed, Many "copies" of each sequence will generally be present. 

[0035] According to one embodiment, conventional Merrifield techniques are used for the synthesis of peptides, such 
as described In Atherton et al., "Solid Phase Peptide Synthesis," IRL Press, (1 989), previously incorporated herein by 
reference for all purposes. Of course other synthesis techniques will be suitable when different monomers are used. 

30 For example, the techniques described in Gait et al., Oligonucleotide Synthesis, previously incorporated by reference 
herein by reference for all purposes, will be used when the monomers to be added to the growing polymer chain are 
nucleotides. These techniques are only exemplary, and other more advanced techniques will be used in some embod- 
iments such as those for reversed and cyclic polymer synthesis disclosed in U.S. application Serial No. 07/796,727, 
previously incorporated herein by reference for all purposes. 

35 [0036] A large number of beads are utilized such that the beads may be separated into separate reaction vessels in 
later steps and still be present in sufficient numbers such that the presence of a complementary receptor may be 
detected. As a general rule, it will be desired to use 10 to 100 or more times the number of combinatorial possibilities 
for the synthesis so as to ensure each member of each set is synthesized. Also, the use of a large number of beads 
ensures that pooled reaction products are distributed to each succeeding reaction vessel when a pooled group of 

40 beads is divided. 

[0037] The beads are preferably as small as possible so that the reaction vessels and other material handling equip- 
ment utilized in the process may also be as small as possible. Preferably, the beads have a diameter of less than about 
1 mm, and preferably less than about 1 00 [xm, and more preferably less than about 1 0 ^.m. In some embodiments, the 
synthesis is carried out in solution. In other embodiments, the synthesis is carried out on solid substrates, and the 

45 resulting polymers are then cleaved from the substrates before binding with a receptor 

[0038] As shown in Fig. lb the monomers A, B, and C are coupled to substrates in three reaction vessels 4a, 4b, 
and 4c, respectively. A single substrate is shown in Fig. 1 b for purposes of clarity, but it will be recognized that in each 
reaction vessel a large number of beads will be present. Accordingly, a large number of "copies" of the substrates with 
the respective monomers coupled thereto are fonned in each of reaction vessels 4a, 4b, and 4c. It will be recognized 

50 that the monomers need not be directly coupled to the substrate, and in most cases linker molecules will be provided 
between the monomers and the substrate, such as those described in U.S. application Serial No. 07/624,120, incor- 
porated herein by reference for all purposes. Also, it should be recognized that the steps shown in Fig. lb may be 
preceded by or followed by other synthesis steps which may or may not be combinatorial steps using the techniques 
described herein. 

55 [0039] Thereafter, a fraction of the products in each of vessels 4a, 4b, and 4c are combined, mixed, and redistributed 
to each of reaction vessels 6a, 6b, and 6c. The remaining fiaction of the products in each of vessels 4a, 4b, and 4c is 
not combined. Instead, the remaining fraction of the products in reaction vessel 4a is divided and placed in reaction 
vessels 8a, 8b, and 8c. Similarly, the remaining fraction of the products in vessel 4b is divided and placed in vessels 
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10a, 10b, and 10c. The remaining fraction of the products in reactant vessel 4c is divided and placed in reaction vessels 

12a, 12b, and 12c. 

[0040] The reactants placed in vessels 6a, 6b, and 6c are referred to herein as "pooled" reactants since they comprise 
a mixture of the products resulting from the previous coupling step. The reactants placed In vessels 8, 10, and 12 by 
5 contrast are separate reactants since they are not mixtures of the products from the previous coupling steps. According 
to a preferred embodiment of the invention, after the reactants in vessels 8, 10, and 12 are subjected to a separate 
coupling step, they are subjected only to pooled coupling steps thereafter. Conversely, in each subsequent coupling 
step, the pooled reactants are subjected to a coupling step, and divided for subsequent separate and pooled coupling 
steps. 

w [0041] Preferably, the reactants are divided such that a greater fraction of the beads is distributed for pooled syn- 
thesis. For example, in Fig. 9, 4/5 of the beads would go to the first pooled group 905 while 1/5 would go to the unpooled 
group 903. 

[0042] Thereafter the monomers A, B, and C are coupled to the growing polymer chain in reaction vessels 8a, 8b, 
and 8c, respectively. The resulting polymers then have the monomer sequence CA, CB, and CC in reaction vessels 
15 8a, 8b, and 8c, respectively. The products of these reactions are then mixed or pooled in reaction vessel 9, and the 
mixture is again divided among reaction vessels 14a, 14b, and 14c. The monomers A, B, and C are again coupled to 
the growing polymer chains in vessels 14a, 14b, and 14c, respectively. The products of these reactions are again mixed 
or pooled and placed in vessel 16a, 

[0043] Similarly, the monomers A, B, and C are coupled to the growing polymer chain in reaction vessels 10a, 10b, 

20 and 10c, then mixed in vessel 18, divided, and placed In reaction vessels 20a, 20b, and 20c. Monomers A, B, and C 
are coupled to the growing polymer chain in vessels 20a, 20b, and 20c respectively, mixed, and placed in vessel 16b. 
Monomers A, B, and Care also coupled to the growing polymer chain in reactant vessels 12a, 12b, and 12c respectively, 
mixed, and placed in vessel 21 . These products are divided for reaction with monomers A, B, and C In vessels 22a, 
22b, and 22c respectively mixed, and placed in vessel 16c. A characteristic feature of the preferred embodiments of 

25 the present invention should be noted in the right half of Fig. lb. Specifically, once the products of a reaction are not 
pooled (such as In vessels 8, 10, and 12), the products of coupling steps are always pooled thereafter. 
[0044] Referring to the left hand portion of Fig. lb, the pooled reactants in vessels 6a, 6b, and 6c are coupled to 
monomers A, B, and 0 respectively, resulting in the products shown in vessels 26a, 26b, and 26c. Since the products 
in vessels 26a, 26b, and 26c are derived from a "chain" of pooled reactions, the products are separated for both pooled 

30 and separate reactions. Specifically, a portion of the substrates in vessels 26a, 26b, and 26c are combined, mixed, 
and divided for pooled reactions with monomers A, B, and C in vessels 28a, 28b, and 28c respectively. In addition, the 
remaining portion of the products in vessels 26a, b, and c are separately divided and placed in reaction vessels 30a- 
c, 32a-c, and 34a-c respectively. The materials in vessels 30a, 32a, and 34a are coupled to monomer A, the materials 
In vessels 30b, 32b, and 34b are coupled to monomer B, and the materials in vessels 30c, 32c, and 34c are coupled 

35 to monomer C. Since the products in vessels 30, 32, and 34 result have been preceded by a separate reaction, the 
products in vessels 30, 32, and 34 are pooled, or mixed, and placed in vessels 36a, 36b, and 36c, respectively. 
[0045] For reasons that will be discussed further below, the vessels in group 42 are used to determine the identity 
of the monomer In the first position in a polymer that is complementary to a receptor The vessels in group 44 are used 
to determine the identity of the second monomer in a polymer that is complementary to a receptor The vessels in 

40 group 46 are used to determine the identity of the third monomer in a polymer that is complementary to a receptor 
[0046] For example, assume that a given receptor is complementary to the monomer sequence ABC, but the se- 
quence of the complementary polymer is not known ab initio. If the receptor is labelled with an appropriate label such 
as fluorescein and placed in each of the vessels in groups 42, 44, and 46, fluorescence will be detected only in vessels 
16c, 36b, and 28c since the polymer sequence ABC appears only in these vessels. Fluorescence may be detected 

45 using, for example, the methods described in Mathies et al., U.S. Patent No. 4,979,824, incorporated herein in its 
entirety by reference for all purposes. 

[0047] Since all of the polymers in vessel 1 6c have monomer A in the first position, and none of the polymers in 
vessels 1 6a or 1 6b have monomer A in the first position, it is readily determined that the monomer in the first position 
of a complementary polymer is the monomer A. Similarly, since all of the polymers in vessel 36b have the monomer 
50 B in the second position, it is readily determined that the monomer B must occupy the second position of a comple- 
mentary polymer sequence. Similarly, since all of the polymers in vessel 28c have a C monomer in the third position, 
the complementary receptor must have a C in its third position. Therefore, it would readily be determined that the 
complementary sequence to the receptor has the monomer sequence ABC. 

[0048] As will be seen upon careful examination of the sequences in the vessel groups 42, 44, 46, ambiguities will 
55 generally not arise, regardless of the monomer sequence which is complementary to the receptor of Interest. As a 
point of comparison, if the receptor of interest is complementary to the sequence BBA, fluorescence would be detected 
only in vessels 16b, 36b, and 28a. From this information is becomes clear that the complementary monomer sequence 
must be BBA. 
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[0049] The above embodiment illustrates the synthesis of pooled groups of polymers by way of separation into sep- 
arate vessels, followed by coupling and mixing. It will be recognized that this is only for convenience of illustration and 
that in some embodiments the pooled groups of polymers will be synthesized under controlled conditions by simulta- 
neous reaction of each of the monomers to be coupled to the polymers in a single reactor. Further, it will be recognized 
5 that the synthesis steps above will be supplemented in many embodiments by prior, intermediate, and subsequent 
coupling steps, which are not illustrated for ease of illustration. 

[0050] The above method may be generally illustrated by way of the adoption of appropriate nomenclature. For 
example, letX-, denotethesetof monomer units that become joined to a growing chain at reaction round 1. For example, 
suppose that 

10 

Xi={L,G} X2={P,Y} X3={R,A} 
A particular monomer is denoted by X|j. For example, 

15 

Xg 1 = R 

The reaction products S3 of such a three-round peptide synthesis is concisely represented by 

20 

S3 = X3X2X., 

S3 is determined by expanding a reaction polynomial as described in Fodor et al., Science (1 991 ) 251 :767-773, incor- 
25 porated herein by reference for all purposes. 

S3 = (R-i-A) (P+Y) (L-i-G) 

30 and so S3 consists of 8 tripeptides: 

RPL, RYL, RPG, RYG, APL, AYL, APG, add AYG 
S|j denotes a set of reaction products terminating in monomer unit Xy. In the above synthesis, for example, 

Si2=G S2i={PL,PG} S32={APL,AYL,APG,AYG} 

[0051] This three-round synthesis can also be represented by a reaction graph, as shown in Fig. 2. Each reaction 
product of round i is depicted by a filled dot on the same horizontal level. Each dot of round 1 is joined to each dot of 
the preceding round and to each dot of the succeeding round. For example, the dot denoting S21 is joined to the dots 
40 for S11 and S12, and also to the dots S31 and S32. Note that dots on a level are never connected to each other because, 
by definition, monomer units of a round do not combine with one another. 

[0052] It is generally assumed that the products of each round are spatially separate and addressable. Each can 
then be readily assayed. However, the number of compounds generated by a combinatorial synthesis can, after a few 
rounds, greatly exceed the number of experimentally available bins or vessels. It is then advantageous to pool the 
45 products of one or more rounds of synthesis. For example, a five-round synthesis using the basic set of 20 amino acids 
yields 20^ or 3.2 x 1 0^ pentapeptides. In contrast, if the products of the first two rounds are pooled, the subsequent 
three rounds yield only 8,000 sets of products. Information is lost in the pooling process, but the number of products 
becomes experimentally tractable. 

[0053] The above representation of combinatorial synthesis may bemodlfied to tal<e into account the effect of pooling. 
so Suppose that products of the first two rounds of the three-round synthesis mentioned earlier are pooled. The reaction 
graph for a with pooled steps is shown in Fig. 3. The pooled products of round i are denoted by P| to distinguish them 
for the separate products S|. In a reaction graph, pooling is shown by the convergence of lines from the S| that are 
pooled. P| is then shown as an open circle. 
[0054] In this example. 



Pl = {L-i-G} P2 = {PL-hPG-kYL+YG} 



EP 1 324 045 A2 



S3 = X3 P2 = {RPL+RPG+RYL+RYG,APL+APG+AYL+AYG} 

The plus sign joins products tliat are present in a mixture. In contrast, products separated by commas are located in 
5 separate bins and are spatially addressable. In this example, the pooled products of the second round are located in 
one bin, whereas the products after three rounds are located in two bins. One bin contains the mixture 
RPL+RPG+RYL+RYG, and the other bin contains the mixture APL+APG+AYL+AYG. 

[0055] This reaction graph can be simplified. Suppose that P., was ooupled to a equimolar mixture of Xj^ and X22 
a single bin. If the coupling efficiencies for all species are the same, the amounts and kinds of products obtained would 
to be the same as that given by coupiing with X21 and X22 in separate bins and then pooling the products. Thus, pooled 
products and pooled reactants are formally equivalent provided that the reactions occur in a substantially homogeneous 
solution and ail coupiing efficiencies are substantially the same. Hence, an X3P2 synthesis can be most simply repre- 
sented by the reaction graph shown in Fig. 4. 

[0056] The line joining P2 to P., means that ail products in P., are coupled equally to all reactants Xg, either by (1) 

15 adjusting the concentrations of reactants or (2) driving the reactions to completion in separate bins, followed by pooling. 
For beads or other discrete particles, (2) more often applies so that each particle expresses only one Icind of product. 
[0057] Byway of comparison, the synthesis of 1 80 pentapeptides in Furl<a et a[., "General IVIethodfor Rapid Synthesis 
of Multlcomponent Peptide Mixtures," Int. J. Peptide Protein Res . (1991) 37:487-493, is represented with the above 
nomenclature as S5 = X5P4, where X^={^].. X2={E.RKJ. X3={E.PX}, X4={E,F,G,K}, and X5={E,G,K,L,P}. The peptide 

20 combinatorial library synthesis in Houghten et al., "Generation and Use of Synthetic Peptide Combinatorial Libraries 
for Basic Research and Drug Discovery," Nature (1991) 354:84-86 is Sg = Xg X5P4, where each X^ is a set of 18 
naturally occurring amino acids. The Sg products are located in 1 8x1 8 or 324 bins, each containing a mixture of 1 8^ = 
5,832 hexapeptides. The pooled synthesis in Lam et al., "A new type of synthetic pepride library for identifying ligand- 
binding activity," Nature (1 991 ) 354 :82-84, is represented using the above nomenclature as P5, where each X., is a set 

25 of 1 9 naturally-occurring amino acids. P5 is a mixture of 1 9^ = 2,48x1 0^ beads, each bearing one kind of peptide, 
[0058] In the pooled syntheses of Houghten, Lam, and Furka, all products from round 1 to round n are mixed. In 
Furka's synthesis (X5,P4), the first four rounds are pooled. In an X3P2 synthesis, the first two rounds are pooled. 
[0059] Representative pooled syntheses techniques according to one preferred embodiment of the invention herein 
are shown in Figs. 5a, 5b, and 5c. The symbol X^p means that the reactants of round i have been pooled without 

30 pooling the reaction products of previous rounds. This is achieved by, for example, (1) mixing the reactants X.| or (2) 
by reacting each member of X^ with each reaction product of Sj..,, as shown in Fig. 6 for X3pX2X.,p. 
[0060] For pentapeptides made of the naturally occurring 20 amino acids for example, a family of five pooled syn- 
theses groups according to the invention herein will be particularly useful: 

X5X4pX3pX2pXip X5pX4X3pX2pXip X5pX4pX3X2pXip 
X5pX4pX3pX2Xlp X5pX4pX3pX2pXi 

The products of each of these five syntheses product groups would be located in 20 physically isolated bins. Each bin 
would contain a different mixture of 160,000 pentapeptides. As with the trimer illustrated in Fig. 1 b, the identity of the 
monomers forming a complementary pentamer would be determined unambiguously by identifying which of the 20 
bins in each of the five syntheses product groups showed binding to a receptor. 
40 [0061] It is to be recognized that while "bins" are referred to herein for the sake of simplicity, any of a variety of 
techniques may be used for physically separating the peptide or other polymer mixtures. 

[0062] IVlore specifically, a sequence of monomers in a complementary ligand for a receptor is identified as follows. 
For example, consider the family of pooled tripeptide libraries made of the 20 naturally occurring amino acids: 

^3^2p^1p ^3p^2^1p ^3p^2p^1 

45 The most potent amino acid at the left position (Xj.,) is revealed by analysis of the 20 bins of XgXjpX^ip; X2j is determined 
by analysis of X3pX2X.|p: and x^^ is determined by analysis of X3pX2pX.|. The sequence of the most potent tripeptide 
is then predicted to be X3|X2jXik. Accordingly, each pooled group in the library reveals the identity of a monomer in a 
different position in a complementary polymer. 

[0063] it will be recognized that it will not always be desirable to determine the identity of the entire sequence of 
50 monomers in a polymer that is complementary to a receptor, instead, it will only be necessary to determine the identity 
of selected monomers in a polymer in some instances. The monomers of interest may be at intermediate locations on 
the chain of polymer, and may be interspersed by other monomers. Accordingly, in a more general sense, the method 
herein provides for the synthesis of a library of polymers. The library is used to identify at least two monomers of interest 
in the polymer chain. 

55 [0064] For example, the identity of the Xgj monomer is determined by analysis of a library of polymers T-Xj-I-X^p-T; 
and the identity of the monomer x^^ is determined by analysis of a library of polymers T-X2p-I-X.|-T, where T indicates 
terminal groups on the polymer chain, which may be null groups, and i designates intermediate groups in the polymer 
chain, which may also be null groups. 
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[0065] The method of making the library used pooled and separate synthesis steps. The polymers have at least two 
monomer locations at which it is desired to determine the identity of monomers which provide a polymer with a sequence 
complementary to a receptor. The library is synthesized such that the products of a pooled synthesis are separated 
and subjected to a separate synthesis and a second pooled synthesis. The products of the separate synthesis are 
5 subjected to a series of pooled syntheses, without any further separate synthesis in preferred embodiments. Converse- 
ly, the products of the second pooled synthesis are divided and subjected to both a separate syntheses and a third 
pooled synthesis. 

[0066] The synthesis steps result in a library of polymers having at least first and second subsets. The first subset 
is used to determine the identity of a monomer or monomers at a first location in the polymer chain which is comple- 
10 mentary to a receptor. The second subset of the library is used to determine the identity of a monomer or monomers 
at a second location in the polymer chain which is complementary to a receptor 

[0067] The method uses summated assays to identify optimal sequences. The distribution of activities In the mixture 
assayed remains unknown. Only the aggregate activity Is determined. More information can be obtained from analyses 
of beads or other particles that contain multiple copies of one kind of sequence. The activity of each bead can be 
15 quantltated even though Its Identity Is unknown. 

[0068] Suppose that 2 ^.m diameter beads are used for pooled syntheses. Some pertinent properties of typical beads 
are: 

Volume = 4.2 ixm^ 
20 Surface area = 1 2,6 jxm^ 

Number of target sites = 1 .3 x 1 0^ 

(assuming 1 per 100 mm^) 

Number of beads per cm^ = 2.4 x 1 011 

25 [0069] Fluorescence measurements of beads flowing rapidly through a laser beam are made using techniques such 
as those in U.S. Patent No. 4,979,824, previously incorporated herein by reference for all purposes, which provide 
exemplary methods for determining the distribution of activities in a pooled synthesis. 

[0070] Assume a light beam diameter of 2 ^.m Is used for detection of fluorescein labeled beads, at a flow rate of 20 
cm/s. The transit time of a bead through the beam Is then 1 0 jis. The emission rate from a single chromophore can be 

30 as high as lO^s'i. If 10% of the target sites are occupied, this corresponds to an emission rate of about IQi^ s''', or 
1 0^ emitted photons in 1 0 ^s, which would be easily detected. If 1 0% of the sample volume Is occupied by beads, an 
average of one bead would pass through the beam every 0.1 ms. Thus, ^0'^ beads could be analyzed per second. A 
library of 3.2x1 0^ beads (each bearing a different pentapeptlde) could be analyzed In about 6 minutes. 
[0071] Alternatively, the beads may be analyzed by spreading them on a surface. For example, 3.2x1 0^ beads would 

35 occupy 1 .28x1 0^ jtm^ If packed together In a square array. In 1 .28 cm^, these beads would occupy 1 0% of the surface 
area. Smaller beads, say 0.2 [xm^, would give a sufficient fluorescence signal. The advantage of smaller beads Is that 
higher bead densities could be used, leading to a marked reduction in the time needed for analysis. 
[0072] The fluorescence pulse height distribution emerging from either analysis would reveal whether there are many 
or few optimal sequences contained within the sample of beads. In the simplest case, a single bright bead is seen in 

40 just one bin of a pooled synthesis. The identity of the best sequence then comes directly from analysis of each pooled 
synthesis of the family. 

[0073] In other cases, there is a distribution of intensities within several sets of beads. As a general rule, positioned 
libraries where binding is exhibited in multiple bins indicates that a particular position plays a less significant role in 
binding. In some embodiments, positions where ambiguity are detected are further evaluated through use of the VL- 
45 S\PS™ technique. The VLSIPS™ arrays will vary only those positions wherein the monomer has not been determined 
unambiguously. The present invention is used, therefore, to reduce the number of polymers which will be screened 
with VLSIPS™ in some embodiments. 

[0074] In still other cases, polymer mixtures synthesized In multiple bins are cleaved from their respective beads 
and then assayed for activity. The freed polymers are then able to Interact with receptors In various orientations. The 

50 activity of such polymers can be assayed by various well-known techniques such as ELISA. 

[0075] Fig. 9 provides an alternative description of the Invention. As shown therein, at step 901 a collection of sub- 
strates Is subjected to pooled and separate coupling steps, resulting In pooled and separate products 905 and 903, 
respectively. In comparison with the embodiment shown In Fig. lb, products 903 are analogous to the products shown 
In vessels 8, 10, and 12, and products 905 are analogous to the products in vessel 6. The collection of substrate 

55 products 903 are then subjected to pooled coupling steps 903, 905, 907, and 909, i.e., the subsequent coupling steps 
to the separate reactants are only pooled coupling steps. Accordingly, the Identity of the monomer in the first position 
of a polymer complementary to a receptor Is determined by evaluation of the products 907. 

[0076] Conversely, the pooled products 905 are divided and subjected to pooled and separate coupling steps 909, 
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resulting in pooled and separate products 907 and 913, respectively. As with the separate products 903, the separate 
products 913 are subjected only to pooled coupling steps thereafter, resulting in pooled products 915 and 917. The 
products 917 are used to determine the monomer in a second position in a polymer complementary to a receptor of 
interest. 

5 [0077] In the same manner, the pooled products 907 are divided and subjected to pooled and separate coupling 
steps 919, resulting in pooled and separate products 923 and 921 . The separate products 923 are subjected only to 
a pooled reaction thereafter, the products 925 being used to determine the monomer in a third position in a polymer 
complementary to a receptor of interest. The pooled products 921 are divided and subjected to pooled and separate 
reactions 927, resulting in pooled and separate products 929 and 931 . The products 907, 91 7, 925, 931 , and 929 are 

?o used to identify complementary receptors. In the preferred embodiment, the pooled products 927 are first used to 
determine if any polymers of interest are present. The separate products 931 are used to determine the Identity of a 
monomer In a fourth position of a polymer complementary to a receptor. 

[0078] As shown in Fig. 9, pooled products that have not been subjected to prior separate reactions are divided and 
subjected to pooled and separate reactions according to the Invention herein. Conversely, products which result from 

15 a prior separate coupling step are only subjected to pooled coupling steps. 

[0079] In an alternative embodiment depicted In Figs. 10a-10c, a recursive retrosynthesis Is employed to screen a 
diverse set of polymers. Unlike the recursive retrosynthesis method of Houghten et al. described supra, this method 
identifies the sequence of a "best" polymer by Identifying a collection of polymers ("library") containing the bead giving 
the strongest signal, Houghton etal., In contrast, identity the entire library, rather than the single polymer (bead), having 

20 the strongest signal. Thus, the technique of Houghton et al. may identify an incorrect monomer in the sequence of 
interest because of the library containing that monomer gave the strongest signal, while the "best" polymer is located 
in a different library. The recursive retrosynthesis embodiment of this invention overcomes this difficulty of Houghton 
et al.'s by identifying the individual polymer giving the strongest signal. 

[0080] Referring to Figs. 1 0a-1 Oc, an example of this process is described for the set of all pentamers formed from 
25 a basis set of 50 monomers. As shown in Fig, 1 0a, the complete collection of quadramers is synthesized on a number 
of beads (e.g., 3 x 10^ beads) by four cycles of alternately dividing, reacting, and pooling the beads. The pool of 
quadramers is then divided into 50 bins, each of which is reacted with a different member of the basis set to give 50 
bins of pentamers, each containing only those pentamers terminating in a specified monomer. In the notation used 
herein, this collection of polymers is represented by X.|p Xgp Xjj^i.jq), The Individual beads in each bin are then 
30 assayed to identify the single best bead (I.e., the bead providing the strongest signal on binding with the receptor of 
Interest). This may be accomplished In about eight hours by FACS as described above, for example. Having determined 
the bin containing the bead providing the strongest signal, the identity of the monomer In the fifth position Is known. In 
the example of Fig. 10, that monomer Is D. 

[0081] Next, the complete collection of trimers Is fonned as before (from e.g., 6 x 1 0^ beads) by cycles of dividing, 
35 reacting, and pooling the beads as shown In Fig. 1 0b. Alternatively, the library of trimers could be set aside after the 
third cycle of the previous step (during formation of the complete library of pentamers). At this point, the pooled beads 
are divided into 50 bins, each of which is reacted with a different member of the basis set. The quadramers in each 
bin are then reacted with D to produce a collection of beads represented by X.|p Xjp Xgp X4(.|.50) D. The beads in each 
bin are then assayed to again identify the bead giving the strongest signal. The bin from which that bead was taken 
40 identified the monomer at the next position: A in this case. The above process is repeated to produce X.|p X2P ^^^(■[.50) 
AD and identify the monomer at the next position as shown in Fig. 10c. In this example, the next monomer Is Identified 
as Q. The final two monomers of the sequence can be identified in the same fashion. However, It may be more efficient 
to simply screen the remaining 2,500 possible pentamers via a VLSIPS^" technique. 

[0082] In general, for a polymer of length N synthesized from a basis set of n monomers, the terminal monomer may 
45 be identified by the following procedure. First, a pooled library of substrates is formed such that each substrate has a 
different polymer synthesized. The pooled library Includes a collection of polymers represented by X.,pX2p...X(fg..|jp. 
The library Is divided Into n separate bins, each of which Is then reacted with a different monomer to form a library 
XipX2p 'X(N-i)pXN(i-n)- Finally, a receptor Is exposed to the substate In each of the n separate bins to Identify the bin 
containing the polymer which binds the receptor most strongly. This bin provides the identify the monomer In the 
50 position. The penultimate monomer Is Identified by aslmllarprocedurefromX.|pX2p...X(N.i)(i.n)Fl, where Risthetermlnal 
monomer previously identified. Each succeeding monomer can be Identified in the same manner. In this example, the 
two basis sets used to identify the monomers at positions N and N-1 each contained n members. It will be appreciated 
that the monomer basis sets used to identify the monomer at each position on the polymer may independent of the 
other basis sets. 

55 [0083] A combinatorial synthesis chamber for conducting the synthesis, pooling, and dividing steps employed in 
each cycle of this invention Is Illustrated In Fig. 11a-11c. Individual chambers 200, each containing an amount of packed 
beads 203, are aligned In close proximity to one another to form a two-dimensional array. The reaction chambers 200 
are mounted on a base 207 via passages 211 . A filter 213 Is provided at the base of each coupling chamber 200 to 
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prevent the beads from leaving the coupling chambers 200 when solutions are drained through passages 211. In 
synthesis mode, coupling solutions are introduced through passages 205 while cover 201 is in a closed position (as 
shown in Fig. 11a). The contents of each chamber 200 are prevented from contacting adjacent chambers by cover 
201 . Passages 205 and 211 are controlled by valves or other control mechanisms not shown. 

5 [0084] After the coupling reactions have proceeded to the desired extent, the coupling solutions are drained from 
the synthesis chamber through the passages 211. Subsequent washing steps may be necessary before pooling and 
redistribution. In such washing steps, washing solutions are introduced through passages 205 while cover 201 is held 
in the closed position. After sufficient time has elapsed, the washing solution is drained through passages 211 . IVIultiple 
washing steps may be performed as necessary to remove unused coupling solutions from chambers 200. During the 

to reacting and washing steps, the beads in the reaction chambers may be agitated by rotating or shaking the entire 
synthesis chamber. 

[0085] As shown in Fig. 11 b pooling and dividing of the beads is accomplished by sliding cover 201 away from base 
207 to form a mixing chamber 215. The previously packed beads 203 are then agitated in a fluid such that they mix 
within mixing chamber 215. Ultimately, when the agitation is stopped, the beads will settle randomly into various cham- 

15 bers 200. At that point, the suspension solution can be drained through passages 211 , and cover 201 can be lowered 
to rest on thetops of chamber 200 as shown in Fig. 1 1 a. During the pooling and dividing stages, the valves in passages 
205 and 21 1 are closed. A sealing means 209 is provided to prevent beads or fluid from leaving the synthesis chamber. 
[0086] Fig. 11c shows a top view of a two-dimensional array of reaction chambers 200 mounted on base 207. The 
chambers shown are arranged in a 7 x 7 array of 49 chambers. 

20 [0087] Fig. 12 illustrates a library of polymers which will be useful in accordance with the invention herein. As shown 
therein, the polymers have a number of monomer positions, designated by p^. The monomers have at least two posi- 
tions of interest, p^ and pj. p^ and pg may in some embodiments be separated by various intermediate monomers or 
groups, and may also have various terminal groups attached thereto. The monomers are placed in a number of phys- 
ically isolated bins or vessels 1002. The bins or vessels 1002 may in fact be attached, such as in a microtiter plate, or 

25 the bins/vessels may be distinct containers such as test tubes, microtiter trays, or the like. 

[0088] A first bin 1 002a contains polymers with a first monomer in the first position in each of the polymers 
therein. However, the polymer molecules in the first bin have a variety of different monomers such as IVI^ , l\/l2, and M3 
in a second position P2 In the second bin 1002b a second monomer M2 is in the first position p.| in each of the polymers 
therein, while different monomers such as IVI.,, Mg, and Mg are in the second position pg. In the third bin 1002c a third 

30 monomer Mg is in the first position p^ in each of the polymers therein, while different monomers such as l\/l.|, M2, and 
Mg are in the second position P2. The first, second, and third bins comprise all or part of a collection of bins ...X^ ...X2p.... 
[0089] Conversely, fourth bin 1 002d contains polymers with a first monomer M.| in the second position P2 in each of 
the polymer molecules therein. The polymer molecules in the first bin have a variety of different monomers such as 
IVI^, U2, and Mg in their first position p.|. In the fifth bin 1002e a second monomer M2 is in the second position P2 in 

35 each of the polymers therein, while different monomers such as M.,, M2, and Mgare in the first position p.,. In the sixth 
bin 1 002f a third monomer Mg is in the second position P2 in each of the polymers therein, while different monomers 
such as M2, and Mg are in the first position p^. The fourth, fifth, and sixth bins comprise all or part of a collection 
of bins ...X.|p...X2.... 

[0090] in screening studies, the bins 1002a, 1002b, and 1002c are used to determine the identity of the monomer 
40 in position 1 of a polymer that is complementary to a receptor of interest. The bins 1 002d, 1 002e, and 1 002f are used 
to determine the identity of the monomer in position 2 of a polymer that is complementary to a receptor of interest. 
[0091] It will be recognized that the polymers which are screened according to the above methods can be of widely 
varying length and composition. For example, in preferred embodiments, the polymer molecules are preferably greater 
than 3 monomer units long, preferably greater than 5 monomer units long, more preferably greater than 1 0 monomer 
45 units long, and more preferably more than 20 monomer units long. Although a simplified library is shown in Fig. 12, it 
will be recognized that in most embodiments, the library will include additional polymer bins so as to identify the mon- 
omers at more than 3 positions, preferably more than 5 positions, more preferably more than 1 0 positions, and more 
preferably more than 20 positions in a complementary polymer to a receptor. 

50 III. Polynomial Factoring Applied to Screening 

[0092] In some embodiments a population of all possible polymers of length n are synthesized. If a receptor is found 
to bind with one of the polymers in the mixture, a second synthesis is conducted in which the polymers are "factored," 
i.e., two bins are formed, each having half of the population synthesized initially. It is then determined which of the two 
55 bins shows binding to the receptor, the bin which exhibits binding being referred to as a "target group." Yet another 
synthesis is conducted in which two bins are created, each with half of the population of the target group in the earlier 
bin. The process is repeated until the sequence of the polymer or polymers that show binding to the receptors is 
determined. 
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[0093] More specifically, the invention provides for the synthesis of a population: 



This solution is factored as: 



11/2 



s X. r X. 
i-l 



n/2 • 

[0094] If generates a "hit," is factored. If P2 generates a "hit," P2 is factored. Each synthesis requires only half 
the number of polymers made in the prior step. 

IV. Conclusion 

[0095] The above description is illustrative and net restrictive. Many variations of the invention will become apparent 
to those of skill in the art upon review of this disclosure. Merely by way of example while the invention is illustrated 
primarily with regard to the synthesis of oligonucleotides and peptides, the invention will also find utility in conjunction 
with the synthesis and analysis of a wide variety of additional polymers. The scope of the invention should, therefore, 
be determined not with reference to the above description, but instead should be determined with reference to the 
appended claims along with their full scope of equivalents. 

The following aspects correspond to the claims of the parent application, European Patent Application No. 
93910996.3, as originally filed: 

[0096] According to one aspect of the invention, there is provided a polymer library screening kit comprising families 
of poiymers X3-X2p-Xip, X3p-X2-Xip and X3p-X2p-Xi wherein: 

Xgp-Xjp-X., comprises a collection of at least first and second polymer mixtures, said first polymer mixture having 
a first monomer in a first position of polymer molecules therein, and different monomers in second and third posi- 
tions of said polymer molecules therein, and wherein said second polymer mixture has a second monomer in said 
first position of polymer molecules therein, and different monomers in second and third positions of said polymer 
molecules therein; 

^3p"^2"^ip comprises a collection of at least third and fourth polymer mixtures, said third polymer mixture having 
a third monomer in said second position and said fourth polymer mixture having a fourth monomer in said second 



14 



EP 1 324 045 A2 



position, each of said third and fourth polymer mixtures having different monomers in said first and third positions; 

and 

Xg-Xgp-X^p comprises a collection of at least fifth and sixth polymer mixtures, said fifth polymer mixture having a 
fifth monomer In said third position and said sixth polymer mixture having a sixth monomer in said third position, 
5 each of said fifth and sixth polymer mixtures having different monomers in said first and second positions, wherein 

said first, third and fourth monomers are the same or different and said second, fourth, and fifth monomers are the 
same or different. 

[0097] Preferably the polymer mixtures are selected from the group consisting of mixtures of peptides and mixtures 
?o of oligonucleotides. 

[0098] Preferably, the polymers comprise at least four monomers. 

[0099] Preferably, the polymer library further comprises labelled receptor molecules, and means for identifying mix- 
tures of said polymers to which said receptor molecules are bound. 
[0100] Preferably, the receptor molecules are labelled with a fluorescein label. 
'5 [0101] Preferably, the polymers are coupled to a solid substrate. 

[0102] According to a further aspect of the invention, there Is provided a method of identifying first and second 
monomers in a polymer that Is complementary to a receptor comprising the steps of: 

coupling first and second monomers in a first basis set to individual substrates and mixing substrates to form first 
20 pooled products; 

coupling said first and second monomers from said first basis set to individual substrates, and not mixing said 
substrates to form at least first and second separate products; 

separately coupling first and second monomers from a second basis set to substrates from said first pooled prod- 
ucts and not mixing said substrates to form at least third and fourth separate products; 
25 coupling said first and second monomers from said second basis set to Individual substrates from said first separate 

products and mixing said substrates to form second pooled products; 

coupling said first and second monomers from said second basis set to Individual substrates from said second 
separate products to form third pooled products; and 

exposing a receptor to said third and fourth separate products to Identify a second monomer In a polymer which 
30 is complementary to a receptor, and exposing said second and third pooled products to said receptor to Identify a 

first monomer in a polymer which Is complementary to said receptor. 

[0103] Preferably, the step of exposing to a receptor is preceded by the step of performing additional steps of coupling 
and mixing to said second pooled products and said third pooled products. 
35 [01 04] Preferably, the method further comprises the step of mixing a portion of said third and fourth separate products 
to form fourth pooled products. 

[0105] Preferably, the method further comprises the step of separately coupling monomers from a third basis set to 
said fourth pooled products. 

[0106] Preferably the monomers are amino acids. 
40 [0107] Alternatively, the monomers may be nucleotides. 

[01 08] Preferably the steps of the method are repeated to screen polymers having at least three monomers therein. 
[0109] Preferably, wherein at least one of said first and second monomers cannot be determined unambiguously, 
the method further comprises the steps of: 

45 synthesizing an an'ay of polymers, said potential complementary polymers using a light-directed synthesis tech- 

nique; and 

detecting binding of said receptor to said potential complementary polymers. 

[01 1 0] According to af urther aspect of the invention, there is provided a library of polymers to be used for identification 
50 of a receptor complementary to at least one of said polymers comprising: 

a first set of polymers, said f irst'set of polymers having a first monomer in a first position, and a plurality of different 
monomers at a second position; and 

a second set of polymers, isolated from said first set of polymers, said second set of polymers having a second 
55 monomer in said second position, and a plurality of different monomers In said first position. 

[0111] Preferably, the library further comprises a third set of polymers, isolated from said first and second sets of 
polymers, said third set of polymers having a third monomer In said first position and a plurality of different monomers 
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at said second position. Tlie library of polymers may further comprise a fourth set of polymers, isolated from said first, 
second and third sets of polymers, said fourth set of polymers having a fourth monomer in said second position and a 
plurality of different monomers in said first position. 



Claims 

1 . A polymer library comprising at least three families of substrate-bound polymers, which polymers each comprise 
at least three monomers obtained from a basis set of n monomers and each polymer being bound to a substrate 
10 s via an optional linker L, wherein each family comprises n pools of polymers, where: 

the support-bound polymers in a first family are represented by S-L-X^XapXap, wherein X., represents a first 
monomer position which Is occupied by a different monomer In each of the n pools of polymers, Xgp represents 
a second monomer position which is occupied by a mixture of monomers, and X^p represents a third monomer 

15 position which is occupied by a mixture of monomers; 

the substrate-bound polymers In a second family are represented by S-L-X^pXgXsp, wherein X.,p represents 
a first monomer position which is occupied by a mixture of monomers, Xg represents a second monomer 
position which Is occupied by different monomer In each of the n pools of polymers, and Xgp represents a third 
monomer position which Is occupied by a mixture of monomers; 

20 the substrate-bound polymers In a third family are represented by S-L-XipX2pX3, wherein X^p represents a 

first monomer position which Is occupied by a mixture of monomers, Xgp represents a second monomer position 
which is occupied by a mixture of monomers, and X3 represents a third monomer position which is occupied 
by a different monomer in each of the n pools of polymers; 

25 wherein intermediate groups may optionally be coupled between adjacent X moieties of the polymers, and 

a terminal group may optionally be coupled to the last X moiety of the polymers; 

wherein each family Is ordered such that a monomer sequence which binds to a receptor can be identified 
from the order of pools in the library. 

30 2. The polymer library as recited in claim 1 wherein the library comprises N families of polymers of length N, wherein 
for each additional monomer of polymer length greater than three: 

the polymers In each family as set forth In claim 1 comprise an additional residue X^p, wherein X^p represents 
an Nth monomer position which is occupied by a mixture of monomers; and 
35 the polymer library comprises an additional family of pools of polymers wherein each pool of the family has a 

different terminal monomer and all other monomer positions are occupied by a mixture of monomers. 

3. The polymer library as recited in claim 1 wherein each of said families of polymers are selected from families of 
peptides and families of oligonucleotides. 

40 

4. The polymer library as recited in ciaim 1 , wherein said polymers comprise at least four monomers. 

5. The polymer library as recited in claim 1 , further comprising a labeiled receptor, and a means for identifying which 
of said families of polymers binds to said labelled receptor. 

45 

6. The polymer library as recited in claim 5 wherein said receptor molecules are labelled with a fluorescein label. 

7. The polymer library of claim 1 , wherein said substrates are beads. 

50 8. A method of Identifying three monomers in a polymer that specifically binds to a receptor, the method comprising 
the steps of: 

providing a polymer library as defined in claim 1 ; 
exposing a receptor to each polymer pool of the polymer library; and 
55 determining the identity of the monomer at a first position from the particular pool in the first family to which 

the receptor specifically binds, detennining the identity of the monomer at a second position from the particular 
pool in the second family to which the receptor specifically binds; and determining the identity of the monomer 
at a third position from the particular pool in the third family to which the receptor specifically binds. 
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